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ABSTRACT 

In the last decade polycistronic vectors have 
become essential tools for both basic science and 
gene therapy applications, in order to co-express 
heterologous polypeptides, different systenns have 
been developed from Internal Ribosome Entry Site 
(IRES) based vectors to the use of the 2A peptide. 
Unfortunately, these methods are not fully suitable 
for the efficient and reproducible modulation of the 
ratio between the proteins of interest. Here we 
describe a novel bicistronic vector type based on 
the use of alternative splicing. By modifying the 
consensus sequence that governs splicing, we 
demonstrate that the ratio between the synthesized 
proteins could easily vary from 1:10 to 10:1. 
We have established this system with luciferase 
genes and we extended its application to the 
production of recombinant monoclonal antibodies. 
We have shown that these vectors could be used 
in several typical cell lines with similar efficiencies. 
We also present an adaptation of these vectors to 
hybrid alternative splicing/I RES constructs that 
allow a ratio-controlled expression of proteins of 
interest in stably transfected cell lines. 

INTRODUCTION 

Many applications require co-expression of heterologous 
polypeptides from basic research to gene therapy experi- 
ments. In this purpose, numerous approaches have been 
developed from co-transfection with two independent 
constructs to single vectors where co-expression is 



achieved through the use of several promoters. Internal 
Ribosome Entry Sites (IRES) or Foot-and-Mouth 
Disease- Virus (FMDV)-derived 2 A peptides (1). All 
these strategies have various drawbacks but one particular 
disadvantage is that they do not allow easy, reproducible 
and great modulation of the expression ratio between 
the proteins of interest. However, in several cases, this 
property might be useful. 

One particular example is the production of recom- 
binant antibodies, which are formed by association of 
two light chains (LCs) and two heavy chains (HCs). 
Studies demonstrated that intracellular HC : LC ratio is 
of major importance regarding antibodies production 
efficiency (2,3). The optimum ratio for efficient production 
depends on many factors including the cell type used for 
expression, and whether production is performed in a 
transient or stable context (4,5), Therefore, this ratio has 
to be adaptable to allow optimal antibody production in 
any case. 

The system described in this article is based on 
alternative splicing to ensure regulated co-expression of 
two polypeptides. Alternative splicing is the mechanism 
by which different mature mRNAs can be generated 
from one pre-mRNA through the use of alternative 
splice sites (6), Splice sites define the border of an intron 
and consist of the almost invariant GU dinucleotide, 
called 5' spHce site (5'SS) and the 3' splice site (3'SS) 
that comprises three sequence elements: the branch 
point, followed by a polypyrimidine tract, and the 
terminal AG sequence. Both 5'SS and 3'SS are comprised 
within larger, less conserved consensus regions. Choice 
between alternative splice sites is regulated in many ways 
including the inherent strength of the splice sites, ix. how 
close they are from the consensus sequences (7) and the 
presence of cw-regulating elements in the surrounding 
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sequences, for example Exonic Splicing Enhancers (ESE) 
or Silencers (ESS) (8), 

In the present study we tried to evaluate if alternative 
splicing could be a valuable mechanism to co-express two 
proteins in a regulated manner through the mutation 
of splice sites. The system was tested for different 
applications including multimeric proteins expression, in 
various cell types and both in a transient or stable context. 



MATERIAL AND METHODS 

Plasmid construction 

The vector's backbone was obtained from pCRFL 
previously described (9) containing the chimeric intron of 
pRL-CMV (Promega). Luc R was polymerase chain 
reaction (PCR) amplified from pRL-CMV (Promega) 
using HA-LucR -F and -R primers (primer sequences are 
available in Supplementary Data and Table SI). Primers 
contained the following restriction sites: Avrll/BamHI 
(forward primer) and PacI/SacII/NotI (reverse primer). 
The reverse primer included the sequence of a second 
3'SS consisting in the following elements: branch point, 
pyrimidine tract and acceptor splice site sequence. These 
elements were included between the Pad and NotI 
restriction sites. Luc F was amplified from pGL3 
(Promega) using HA-LucF -F and -R primers, containing 
Pacl/Nhel and Bglll/EcoRV sites. The forward primers, 
used to amplify both fragments, contained the sequence 
coding the hemaglutinin (HA) tag in fusion with the 
luciferase open reading frames. Both PCR fragments 
were respectively digested with Avrll/PacI and Pad/ 
Bglll and cloned into the Xbal/Bglll digested pCRFL, 
Three putative alternative translation initiation codons 
CTG present in the vector 5'~UTR were mutated into 
TTG sequences using the Quikchange XL site directed 
mutagenesis kit (Stratagene) with oligonucleotides CTG- 
mutl to CTG-mut3 -F and -R designed according to the 
manufacturer's instructions. The resulting plasmid, further 
called pVl, was checked by sequence analysis. Plasmid 
expressing the LCs and HCs of the anti-N-VEGF 
antibody further called pAl, was obtained by replacing 
HA-LucR and HA-LucF coding cistrons of pVl by the 
PCR amplified LCs and HCs respectively using BamHI/ 
Xbal and Nhel/EcoRV restriction sites. The sequences of 
the LCs and HCs were PCR amplified with the LC -F/-R 
and HC -F/-R primers from two plasmids previously 
constructed and containing the sequences of each chain 
isolated from the hybridoma cDNA. 

EMCV IRES and EF-la promoter were PCR amplified 
from pCREL (9) and pCREFlL (10) with primers 
(EMCV -F/-R and EFla -F/-R) inserting SacII (5'-end) 
and Nhel (3^-end) restriction sites. The resulting fragments 
were digested with SacII/Nhel and used in a tri-molecular 
ligation with fragments resulting from XhoI/SadI and 
Nhel/Xhol digestion of pAl thus replacing the 3'SS 
between the cistrons by the IRES (pCI) or the EF-lot 
promoter (pCE). 

For stable transfection experiments, a cassette 
containing the VEGF-IRES-A and the neomycin coding 
sequence was inserted downstream of the HC ORF. Both 



elements were PCR amplified from pCRA121mSpHA (11) 
and pCDNA3 (Invitrogen), with primers IRES-VEGF-F/ 
overlap-R and overlap-F/Neo-R, then overlapped. 
Further cloning of this cassette was performed using the 
EcoRV/Bglll digested fragment in a tri-molecular ligation 
with Bglll/Pvul and PvuI/EcoRV digested fragments of 
different plasmids: pAl, mutants pA2 to pA9 (described 
below), pCI and pCE. Resulting plasmids were called 
pAx-s, pCI-s and pCE-s. 

Construction of mutants for the 3^SS 

Degenerated oligonucleotides mut-Vl-F and mut Vl-R 
were used to create mutants on three bases of the second 
3'SS in the pVl vector. Equimolar amounts of each 
oligonucleotide were mixed and hybridized, leading to 
a short double stranded DNA fragment with cohesive 5'- 
and 3 '-ends corresponding to sequences for NotI and Pad 
restriction sites. Different dilutions of the hybridization 
product were cloned into the Notl/PacI digested pVL 
More than 200 clones were analyzed and DNA sequences 
were determined (Millegen sequencing services) in order to 
isolate different mutants for the second acceptor splice 
site. Eight mutants, called pV2 to pV9, were chosen for 
further experiments. 

Construction of the first 3'SS mutants was performed by 
inserting point mutations in pAl using the Quikchange 
XL site directed mutagenesis kit (Stratagene) according 
to the manufacturer's instructions with the primers 
couples listed from MUT-A2 -F/-R to MUT-A9 -F/-R, 
Resulting plasmids were called pA2 to pA9. 

Ceil culture and transfection 

CHO cells (ATCC CCL61) were cultivated in EMEM 
medium (Gibco), HeLa (ATCC CCL2) and NIH-3T3 
(ATCC CRL-1658) cells in DMEM containing 1 g/1 
glucose (Gibco) and HEK 293T cells (GenHunter, Q401) 
in DMEM containing 4.5 g/l glucose (Gibco). All media 
were supplemented with 10% FCS, 1% glutamine and 
antibiotics. All cell types were cultivated at 37°C in a 
humidified 5% CO2 atmosphere. For transient trans- 
fection experiments, cells were seeded either onto six- 
well dishes or 10 cm plates 24 h prior to transfection. 
CHO cells were transfected with Fugene-6 transfection 
reagent (Roche) according to the manufactixrer's 
instructions. HeLa, NIH-3T3 and HEK293T ceUs were 
transfected using the JetPEI transfection reagent 
(Polyplus Transfection) according to the manufacturer's 
instructions. For stable transfection, CHO cells were 
transfected as described earlier with plasmids pAx-s, 
pCI-s and pCE-s and treated with 0.75mg/ml G418 
(Euromedex) 48 h after transfection. Surviving cells were 
pooled and amplified for 25 more days with continuous 
G418 selection. 

Western blot analysis 

Twenty four hours after transfection, cells were collected 
and sonicated in 50jLLl of sodium dodecylsulfate (SDS) 
sample buffer. Protein concentration in the cell lysates 
was determined using the bicinchoninic acid method 
(Interchim). Samples were boiled at 95°C for 5min after 
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addition of reducing agents and 30|j,g of total proteins 
were separated on a NuPAGE 4—12% Bis-Tris gel 
(Invitrogen). After electrophoretic transfer, the nitro- 
cellulose membranes (Schleicher & Schiiell) were blocked 
with 3% skimmed milk. Tagged luciferases were immuno- 
detected using mouse monoclonal anti-HA (dilution 
1 : 1000) (Babco) as a primary antibody and peroxidase- 
conjugated sheep anti-mouse (dilution 1 : 10000) 
(Amersham) as a secondary antibody and the ECL 
detection kit (Amersham). HCs and LCs of the anti-N- 
VEGF antibody were detected using a peroxidase- 
conjugated sheep anti -mouse IgG antibody (dilution 
1 : 2500) (Pr omega). Normalization was done using a 
monoclonal anti p-actin antibody (dilution 1 : 10000) 
(Sigma- Aldrich). Densitometric analysis was performed 
using Scion Image 1.63 software (Scion Corporation). 

Total and polysomal RNA isolation and RT-PCR analysis 

Sucrose gradient fractionation was performed essentially 
as described earlier (12). Cell extracts were prepared 
by lysis at 4"C in extraction buffer (10 mM Tris-HCl, 
pH 8,0, 140 mM NaCl, 1.5 mM MgClz, 0.5% Nonidet- 
P40 and 500U/ml RNAsin), and nuclei were removed 
by centrifugation (12000^, 10 s, 4°C). The supernatant 
was supplemented with 20 mM dithiothreitol, 150|ig/ml 
cycloheximide, 665 |xg/ml heparin and 1 mM phenyl- 
methylsulfonyl fluoride and centrifuged (12 000g, 5min, 
4°C) to eliminate mitochondria. The supernatant was 
layered onto a 5 ml hnear sucrose gradient (15^0% 
sucrose [w/v] supplemented with lOmM Tris-HCl, pH 
7.5, 140 mM NaCl, L5mM MgCls, 10 mM dithiothreitol, 
lOOjLig/ml cycloheximide and 0.5mg/ml heparin) and 
centrifuged in a SWSO.lTi rotor (Beckman) for Ih 
40min at 140 000^ and at 4°C, without brake. Fractions 
of 300|Lil were collected and digested with 100|.ig pro- 
teinase K in 1% SDS and 10 mM EDTA (30min, 37=C). 
RNAs were then recovered by phenol-chloroform-isoamyl 
alcohol extraction, followed by ethanol precipitation. 
Finally, the fractions containing the mRNA, were pre- 
cipitated with 2M LiCl on ice at 4°C overnight- After 
centrifugation (12000^, 15min at 4°C), pellets were 
washed with 70% ethanol pre-stored at -20°C, air dried 
and resuspended in appropriate volumes of RNAse-free 
water. 

Total RNA was extracted using the TriZol reagent 
protocol (Invitrogen). After DNAse treatment (DNA 

free, Ambion) and quantification, reverse transcription 
was done with 2 \ig of total RNA using the First strand 
cDNA Synthesis kit (Invitrogen) and random hexamers. 
The resulting cDNA fragments were PCR amplified using 
three primers allowing simultaneous amplification of the 
three different mRNA species, i.e. spliced on the first 3'SS, 
spliced on the second 3'SS and unspliced (i.e. primers RT- 
CMV-F, RT-LucR-R and RT-LucF-R for the luciferases 
sequences or RT-CMV-F, RT-LC-R and RT-HC-R for 
the antibody's sequences). PCR products were loaded on 
a 2% agarose gel and visualized using SYBR Safe DNA 
gel staining (Invitrogen). Control reactions were per- 
formed using P-actin control primers supplied in the 
Superscript III RT kit (Invitrogen). Cryptic splice sites 



were identified after cloning of the corresponding cDNA 
fragments purified from agarose into the pCR4-TOPO 
(Invitrogen), and sequencing with appropriate primers 
(T3 and T7). Alignment of the resulting sequences with 
the cDNA wild-type sequence allowed identification of 
aberrant splicing events. Corresponding cryptic splice 
sites were inactivated by inserting point mutations using 
the Quikchange site directed mutagenesis kit (Stratagene) 
according to the manufacturer's instructions with primers 
couples listed from QKSCl -F/-R to QKSC5 -F/-R. 

Detection of secreted anti-N-VEGF antibody in the 
culture supernatants 

Microtiter plates were coated overnight at 25°C with 
1 p,g/ml of recombinant N-VEGF diluted in phosphate 
buffered saline (PBS). After three washes with 250 |il of 
washing solution (PBS/0,05% Tween 20), plates were 
blocked with PBS containing 2% (w/v) skimmed milk 
for 2h at 25''C and washed three times. Cell-free super- 
natants (different dilutions) were incubated for 2h at 
25" C. After extensive washing with PBS/T, the detection 
antibody (i.e. goat anti-mouse IgG, HRP conjugated; 
Promega) was diluted at 166ng/ml in PBS/T and 
incubated for 2h at 25°C. Further washes were followed 
by the addition of 100).il of TMB substrate solution (BD 
OptEIA) and the reaction proceeded at 25"C in the dark 
for up to 30min. The reaction was then stopped by the 
addition of 50 |il of 2N H2SO4 and the OD450 was 
measured immediately using a microplate reader 
(UVM340, Asys). Each experiment was repeated at least 
three times. Graphs represent the amount of secreted 
antibody for each mutant transfected relatively to the 
amount secreted with the pAl vector for transient 
transfection experiments and relatively to the amount 
secreted with the pCE-s vector for stable transfection 
experiments. Errors bars represent standard deviation 
and asterisks depict statistically significant differences 
compared to pAl or pCE-s (**P<0.01 and *P<0.05, 
ANOVA test). 

RESULTS AND DISCUSSION 

The aim of this study was to evaluate if alternative splicing 
could be a suitable mechanism to generate different ratios 
of expressed recombinant proteins from a bicistronic 
vector. 

Evaluation of the efficiency of alternative splicing as a 
bicistronic mode of expression 

In a first set of experiments, we wanted to test whether 
alternative splicing could lead to the co-expression of two 
proteins encoded by two cistrons in the same vector. For 
that purpose, we first elaborated a plasmid, called VI, 
comprising a complete intron in the 5'-UTR and an 
additional consensus acceptor splice site (3'SS) between 
the two cistrons (Figure lA). The intron is constituted 
by consensus elements: a donor splice site (5'SS), a 
branch point, a pyrimidine tract and a 3'SS. The construc- 
tion was carried out with the Renilla Luciferase (Luc R) 
and the Firefly Luciferase (Luc F) as reporter genes. 
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Figure L Development of a bicistronic vector based on alternative 
splicing. (A) Schematic representation of the bicistronic vector based on 
alternative splicing (pVl) and the corresponding matures niRNAs. The 
two cistrons consist of the HA-tagged Renilla Luciferase (Luc R) and 
Firefly Luciferase (Luc F), ml corresponds to mRNA spliced on the 
first 3'SS leading to expression of Luc R, m2 corresponds to mRNA 
spliced on the second 3'SS leading to expression Luc F and m-us 
corresponds to unspliced mRNA. (B) An anti-HA antibody was used 
to perform western blot analysis on protein extracts from pVl transiently 
transfected CHO cells. The upper band (61 kDa) corresponds to 
HA-LucF (second cistron) and the lower band (37 kDa) to HA-LucR 
(first cistron). Control (Ctrl) corresponds to mock- transfected cells. 

Although expression of these proteins is usefully followed 
through their enzymatic activities, we wanted to evaluate 
their respective concentrations by western blotting. For 
this goal, we fused the HA tag to their amino-terminal 
ends. Consequently, we replaced their original start 
codon by a consensus AUG leading to a very similar 
initiation of translation (Figure lA). Theoretically, 
transcription of the expression cassette can be followed 
by two distinct types of splicing events (resulting from 
the use of either the first or the second 3'SS) thus 
generating a first mature mRNA allowing Luc R 
expression (ml) and a second mRNA encoding Luc F 
(m2); whereas a third possibility could be generation of 
the unspliced mRNA (m-us) (Figure lA). 

A western blot analysis performed with lysates of CHO 
cells transiently transfected with the VI plasmid revealed 
that the two luciferases were detected but HA-LucF 
(61 kDa) was much more expressed than HA-LucR 
(37 kDa) (Figure IB), Our interpretation was that the 
second 3'SS was predominantly used by the splicing 
machinery compared with the first one. Repeated experi- 
ments also indicated that this ratio was very constant. 

Modulation of the ratio between the two luciferases 

In order to control the ratio between the two different 
mRNAs (and consequently the expression of the two 



luciferases), our strategy was to weaken efficiency of 
the second 3'SS by a directed mutagenesis approach. 
The —3, —5 and —11 positions of the consensus 
sequence (Figure 2A) were selected for mutagenesis for 
the following reasons: the —3 and —5 positions are the 
most conserved (apart the absolute AG) and the —11 is 
central in the polypyrimidine tract and could be changed 
into purines. All mutated constructs (53 mutants obtained 
out of the 64 possibilities) were tested for transient 
transfection in CHO cells followed by western blot 
analysis. Results revealed a large panel of Luc F/Luc R 
ratios between the different mutants. From these data, 
eight mutants that were representative of the whole 
range of expression ratio possibilities were chosen for 
further experiments. These mutants, whose second 3'SS 
sequences are provided in Figure 2A, were called V2 to 
V9. When used to transfect CHO cells, the different 
constructs generated important variations in the relative 
expression of the two luciferases. This was evaluated by 
measuring their enzymatic activities (Supplementary 
Figure SI). Since the specific activities of the two 
luciferases are not clearly established, we also determined 
their expression by western blotting (Figure 2B). Different 
ratios between HA-LucF and HA-LucR quantities can be 
observed from a large majority of HA-LucF to a large 
majority of HA-LucR and including intermediate ratios 
(e.g. ratio close to 1:1). In agreement with the proteins 
expression pattern, semi-quantitative-competitive reverse 
transcriptase (RT) PCR analyses of the different mRNA 
species of the same samples revealed an increased amount 
of RNA spliced on the first 3'SS (ml, Luc R) and a 
decrease in the amount of RNA spliced on the second 
3'SS (m2, Luc F) from mutants V2 to V9 (Figure 2C). 

Additionally, this RT-PCR analysis also showed that a 
significant part of the mRNAs were not spliced (*m-us' 
band). This mRNA could potentially be translated to 
generate HA-LucR. It was also interesting to point out 
the presence of additional PCR products distinct from 
the three described earlier. After cloning and sequencing 
of these bands, two cryptic 5'SS within the Luc R sequence 
were identified and it is worth noting that they were not 
predicted as putative splice sites in silica. 

To assess if this alternative splicing mechanism was 
comparable among cells lines of different origins, the set 
of plasmids was tested, in parallel, in CHO (hamster), 
HeLa (human) and NIH-3T3 cells (mouse). Results 
clearly showed that all three cell types presented similar 
patterns of expression ratios between the two luciferases 
(Figure 2D). Taken together, these results tend to indicate 
that (i) the system developed here allows simultaneous 
expression of two proteins, (ii) easily permits the 
modulation of the ratio between the two proteins and 
(iii) could be used in various cell types with equal 
efficiency. 

Expression of the two chains of an antibody 

After vaUdation of the vector's functionality with two 
reporter genes, the construction was used to express 
a particular heteromultimeric protein consisting in an 
equimolar ratio of two polypeptides, i.e. an antibody. 
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Figure 2. Mutations in the sequence of the second 3'SS allows efficient modulation of the expression ratio between the two luciferases. (A) Mutants 
of the second 3'SS listed from VI to V9 and their corresponding sequences. VI corresponds to the consensus splice site sequence. Mutated bases are 
bolded and the dinucleotide corresponding to the 3' border of the splice site is underlined. The first dinucleotide of exon 2 is in italic. Positions —3, 
—5 and —11 that are chosen to be mutated are indicated under the VI sequence. (B) Western blot analysis on protein extracts from CHO cells 
transiently transfected with the 3'SS mutants listed in Figure 2A. Ctrl corresponds to mock-transfected cells. In the lower panel, dcnsitometric data 
from western blot X-ray films are used for graphical representation of the Luc F/Luc R ratio. (C) RT-PCR analysis on mRNAs extracted from CHO 
cells transiently transfected with the 3'SS mutants pVl to pV9. Competitive PCR amplification was performed using a forward primer hybridizing 
upstream the intron 5'SS and two reverse primers hybridizing from bases 186 to 206 within the Luc R ORE and from bases 133 to 152 within the 
Luc F ORF, thus allowing simultaneous detection of the three mRNA species: m-us, ml and m2, (D) Graphical representation of densitometric data 
calculating Luc F/Luc R expression ratios obtained with the different constructs in CHO, HeLa and NIH-3T3 cell lines. 



Previous studies have demonstrated that the ratio from 
LC to HC is crucial for correct antibodies folding and 
assembly (2). This ratio also influences the amount of 
functional secreted antibodies. The main problem is that 
the optimum ratio depends on the sequence of the 
antibodies, the cell type used for expression and whether 
production is performed in a stable or transient context 
(4,5,13). Consequently, for each antibody to be produced 
in a particular condition, the optimum ratio should be 
established. Furthermore, this ratio should be stable, 
especially when cell lines are created. For these reasons, 
the system developed in this study could be suitable for the 
expression of recombinant antibodies because of its 
adaptability. Indeed, a panel of mutants generating 
various expression ratios between the two chains can be 
easily created, thus allowing identification of the best 
construct for any given antibody, in any cell type chosen 
and in any condition. 

To test the system, we chose a home-made monoclonal 
antibody directed against human N-VEGF, an intra- 
cellular protein that results from the cleavage of the 
L-VEGF precursor that also generates the secreted 
VEGF-A molecule (14) (Figure 3A). The cDNAs 
encoding the two chains of this antibody were cloned and 
sequenced. The LC and HC cDNAs were sub-cloned in the 
bicistronic vector comprising the consensus sequences 
for the different splice sites, respectively as the first and 
second cistron, leading to the pAl construct (Figure 3B). 
A western blot analysis performed on a pAl transfected 



CHO lysate showed that only free LC was detectable 
(data not shown). This indicated that, in this case, 
expression of the first cistron was higher than expression 
of the second cistron, contrary to what was observed 
with the pVl construct. In agreement with knowledge 
in the field of alternative splicing mechanisms (8), this 
change in preferential 3-SS usage could be explained by 
the presence of putative regulatory sequences like ESE 
or ESS that generally He within the coding regions. It 
may thus be explainable that replacing HA-LucR by 
the LC of an antibody may have such an effect on the 
splicing pattern. 

In order to modulate the expression ratio between 
the LCs and HCs, we then chose to mutate the first 
acceptor site. 

Elimination of cryptic splice sites 

Because we had to weaken the strength of the first 3^SS, a 
prerequisite was to identify putative cryptic 3'SS within 
the sequence of the LC encoding cistron. As seen with 
the study on luciferases, such sites can be used by the 
spHcing machinery and thus generate aberrant splicing 
events that could potentially lead to the expression of 
truncated proteins (Supplementary Figure S2A). In 
order to identify these cryptic splice sites, an RT~-PCR 
analysis was performed on mRNAs extracted from pAl 
transfected CHO cells. As expected, this analysis revealed 
the presence of the ml transcript (spliced on the first 3''SS 
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Figure 3. Use of the bicistronic vector for production of a murine recombinant antibody anti-N-VEGF, (A) Schematic representation of the 
precursor L-VEGF molecule which cleavage generates secreted VEGF-A and intracellular N-VEGF. (B) Schematic representation of the bicistronic 
vector constructed for antibody anti-N-VEGF expression (pAl). The LC was cloned as the first cistron and the HC as the second cistron. 
(C) Sequence of the anti-N-VEGF LC constant region and cryptic splice sites of the variable region. The signal peptide is showed in the black- 
lined box, followed by the variable region of the LC. The arrow delimits the variable region (upstream) and the constant region (dow^nstream). 
Identified cryptic 5' (GT) and 3'SS (AG) are underlined and mutations are represented in bold above the sequence. 



and translated into LC) and the complete absence of the 
m2 transcript (spliced on the second 3'SS) (Supplementary 
Figure S2B). However, several other species of inter- 
mediate sizes were detected, most of them quite intense. 
This tends to indicate that many abnormal splicing events 
frequently occurred, generating an important pool of mis- 
spliced transcripts. After cloning and sequencing of 
corresponding cDNAs, several cryptic splice sites were 
identified. Most of them were 3'SS but we also identified 
cryptic 5'SS, One of them, a donor site located three bases 
upstream the LC STOP codon, was found to splice with 
the second constitutive acceptor site in almost 90% of the 
atypical spliced transcripts. Surprisingly, many of the 
identified sites were 3'SS despite the lack of a clearly 
defined pyrimidine tract „ The sites were mutated with 
respect to the amino-acid sequence of the encoded 
polypeptide. As new cryptic splice sites may be activated 
if a mutation alters or removes a genuine nearby site, the 
experiment was repeated three times after each round of 
identification/mutation, and we clearly observed that 
aberrant splicing was considerably lowered after each 
step (data not shown). 

Since we planed to mutate the first constitutive 3'SS to 
weaken its efficiency, it was of importance to test the 
presence of remaining cryptic splice sites under conditions 
of poor efficient intron 3'SS. In this goal, two mutatits 
corresponding to variants V5 and V9 were generated. 
As expected, the analysis revealed activation of nevv^ 
cryptic splice sites which all were corrected. Finally, 10 
3'SS and 5 5'SS were identified and mutated, as shown 
in Figure 3C. 

Analysis of the mRNA profile of pAl transfected CHO 
celis (that contained the LC mutated for all identified 
cryptic splice sites) revealed two major products 



corresponding to the expected spliced niRNAs and no 
extra bands. In these conditions, the ml species was 
clearly more abundant than the ml mRNA 
(Supplementary Figure S2C), 

Modification of the ratio between the two chains of the 
antibody 

In order to modify the ratio between the two different 
mRNAs (and consequently control the relative 
expression of the LCs and HCs) we mutated the 
sequence of the first acceptor splice site. The sequences 
chosen were those of the 8 mutants described earlier and 
mentioned as V2 to V9. These mutants (called A2 to A9) 
were used to transfect CHO and HEK293T cells, two cell 
types typically used for recombinant antibody expression 
because of easy adaptation to suspension and serum-free 
culture conditions, high transfection efficiency and 
high production rates (15). As described earlier with 
the luciferases system, mutants of the acceptor splice 
sites allowed efficient modulation and displayed a wide 
range of LC to HC mRNA ratios both in CHO (Figure 
4A) and HEK cells (Figure 4C). As expected, these 
variations directly correlated with the amount of func- 
tional secreted antibody (determined by antigen-coated 
ELISA) in both cells types as shown in Figures 4B and 
4D. Indeed, for CHO cells, mutants A6 and A8 showed 
significant increase in the level of secreted antibodies 
compared to Al (Figure 4B), mutant A8 being the best 
producer with a 4-fold increase. In HEK cells, the same 
two mutants also led to the best productivity allowing a 
2-fold increase in the level of secreted antibodies. These 
results are in accordance with western blot analysis of 
secreted antibodies (Supplementary Figure S3A). As 
controls, we demonstrated that LC alone could be 
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Figure 4. Effect of the different 3'SS mutants on expression efficiency in transiently transfccted CHO or HEK293T cells lines. CHO (A and B) and 
HEK293T (C and D) cells were transfected with different mutants of the first 3'SS from pA2 to pA9. (A and C) upper panels: RT-PCR analysis on 
mRNAs. Competitive PGR amplification was performed using a forward primer hybridizing upstream the intron 5'SS and two reverse primers 
hybridizing from bases 419 lo 458 of the LC ORF and from bases 303 to 324 of the HC ORF. (A and C) lower panels: western blot analysis on 
protein extracts from CHO (A) and HEK293T (C) cells transfected with the different 3'SS mutants. Anti p-actin was used to control protein loading. 
(B and D) Amount of functional antibodies secreted by CHO (B) and HEK293T (D) cells transfected with the different mutants pA2 to pA9 in 
comparison to pAl containing the consensus sequences for all splice sites, determined by antigen specific ELISA. Data are presented as 
means ±SEM. Statistical significance was evaluated using one-way ANOVA test, *P<:0.05, **P<O.0L 



efficiently secreted and form dimers, whereas HC alone 
could not be secreted as expected (16,17) (Supplementary 
Figure S3B). Obviously, we confirmed that single chains 
were not able to recognize the antigen in the ELISA test 
(data not shown). 

In conditions of transient transfection, it is not evident 
to determine an optimal ratio between the LC and HC 
encoding mRNAs since mutants A6 and A8 gave the 
best results and present reciprocal mRNA ratios. How 
inverse ratios could provide a comparable production 
of antibody is not really understood but similar results 
were provided by others (4). However, it is clear 
that extreme imbalanced ratios were deleterious for 
antibody production. Most of the mutants showed the 
same efficiency when compared between CHO and 
HEK293T. One major difference should be underlined: 
the unspliced forin was more present in HEK293T than 
in CHO cells. This could be interpreted as a less efficient 
recognition of the 5^SS in HEK293T cells. This fact could 
be of general importance since most expression vectors 
possess the same kind of intron in their 5'-UTR, Thus, it 
could be of interest to optimize the efficiency of the 5'SS, 
especially for its use in these cells. 

Study of mRNA species engaged in polysomes 

Since a significant part of the mRNAs was unspliced, 
especially in HEK293T transfected cells, we tried to 
evaluate the translation efficiency of this species that 



have a larger 5^-UTR than spliced mRNAs. This study 
was performed by polysome-associated RNAs assay 
(Figure 5A). For this purpose, HEK293T cells were 
transfected with the A6 mutant and polysome fractio- 
nation was done. Different fractions from the sucrose 
gradient were collected and associated mRNAs corres- 
ponding to species from free mRNAs to mRNAs associa- 
ted with poly-ribosomes were isolated (Figure 5B). 
Fractions were pooled into four distinct groups and 
analyzed by RT-PCR to detect the different species. As 
shown in Figure 5C, mRNA spliced on the first 3'SS, 
mRNA spliced on the second 3'SS but also unspliced 
mRNA are present in all the fractions. This indicates 
that the three species can be efficiently translated 
(fractions 3 and 4), including unspliced mRNA that 
generates the LC. It could be noticed that the ratio 
between unspliced/spliced mRNAs is lower in translated 
mRNAs (mostly fractions 3 and 4) than in untranslated 
mRNAs (fraction 1) or total RNA (Figure 5C, lane T). 
This reinforces a previous conclusion about the need 
to improve efficiency of the intron 5^SS recognition 
since, even if the unspHced form can be partially 
translated, it represents a loss in the efficiency of the 
total mRNA to produce full antibodies. However, an 
important part of transcribed mRNAs remained in 
fractions 1 and 2 suggesting that translational machinery 
may be saturated in these cell culture conditions and that 
increasing the cells translational capacity could be a way 
to improve their productivity. 
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Figure 5, Study of mRNA species engaged in polysomes. (A) Schematic 

representation of polysomal and free mRNAs distribution after sucrose 
density gradient RNA fractionation of pA6 transfected CHO cells. 
(B) RNAs purified from each fraction, collected from top to bottom of 
the sucrose density gradient, were visualised on agarose gel by SyBRgreen 
staining. The fractions were pooled in four samples representing the 
different stages of mRNAs commitment in translation. (C) RT-PCR 
analysis was performed on mRNAs extracted from the four samples 
obtained after polysomes fractionation. To exclude any DNA 
contamination, PCR reactions were performed on the four RNA fractions 
lacking the Reverse Transcriptase step (— RT). Tlie control T corresponds 
to the RT-PCR process with total RNA, i.e. before polysomes 
fractionation, and Ctrl to PCR amplification with pAl as the matrix. 



Comparison of the vector based on aiternatfve splicing to 
other constructs in stably transfected cells 

As many processes for recombinant antibody production 
involve the use of stable cell lines, we finally wanted to 
investigate if the vectors based on alternative splicing 
could be successfully utilized in such conditions. 
Expression cassettes including VEGF I RES- A and the 
gene coding for neomycin were cloned downstream of 
the HC in pAl to pA9 constructs (Figure 6A) to allow 
selection of the stable transfectants under G418 treatment. 
Two additional constructs were generated in order to 
compare, in this context, the vectors based on alternative 
splicing to other types of vectors, classically used for 
recombinant antibody production, i.e. a vector with 
an IRES and a vector with two different promoters 
(Figure 6A). For the vector containing the IRES 
between the two cistrons, we chose the EMCV IRES, 
which is one of the most efficient IRES in a great 
number of cell types and therefore the most commonly 
used for expression of recombinant antibodies (18,19). 
For the vector in which transcription of the HC is under 
a separate promoter, we chose the EE I a promoter which 
was successfully used in vectors for IgG mammalian 
expression (20). 
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Figure 6. Efficiency of alternative-splicing based vectors in CHO stable 
transfectants and comparison to other consU-ucts. (A) Schematic 
representation of the different vectors used to generate stable pools of 
CilO transfectants: bicistronic vectors based on alternative splicing 
(pAx-s), IRES-containing bicistronic vector (pCI-s) and double- 
promoter vector (pCE~s). (B) RT-PCR analysis of mRNAs extracted 
from CHO cells stabiy transfected with the dift'erent 3'SS mutants 
pAl-s to pA9-s. (C) Amount of functional antibodies secreted by 
pools of CHO cells stably transfected with mutants pAl-s to pA9-s 
in comparison to the IRES-containing bicistronic vector (pCI-s) and 
the bigenic vector (pCE-s). Data are presented as means ±SEM. 
Statistical significance was evaluated using one-way ANOVA test. 

RT-PCR analysis performed on mRNAs extracted 
from stable pools of cells transfected with vectors pAl-s 
to pA9-s showed a clear excess in the LC mRNA 
from mutants Al-s to A5-s, an excess of HC mRNA for 
mutants A7-s and A9-s and a more balanced ratio for 
mutants A6-s and A8-s (Figure 6B). 

In order to test the stability of the splicing efficiency 
throughout the cell divisions, a RT-PCR experiment 
was performed on mRNAs extracted from cells at 
passages 2, 5, 10 and 20. The profiles obtained for three 
representative mutants (A4, A7 and A8) clearly indicate 
that the ratio between the different mRNA species was 
stable throughout passages (Supplementary Figure S4). 

As shown in Figure 6C, mutants leading to the best 
productivity were mutants A7-s and A9-s, indicating 
that, in a stable context, an excess for HC mRNA 
tended to be more favourable for antibody productivity. 
Previous results have also reported that an excess of HC 
mRNA or polypeptide can lead to a more efficient IgG 
production in HEK or CHO cells both in transient (4,5) 
or stable transfection (3). The fact that the best producers 
were different in a transient versus a stable context might 
be interpreted with potential differences in the splicing 
efficiency in the two contexts, thus leading to shght 
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differences in the ml/m2 ratio. Another explanation could 
be, as it has been described by Schlatter et ai (4) that 
the optimal ratio between the two mRNAs is different 
depending on whether production is performed in 
transiently or stably transfected cells. 

Figure 6C also showed that, compared to the two 
'reference' constructs, pCE-s and pCI-s, some mutants 
of the vector based on alternative splicing, mutants A7-s 
to A9-S, allow secretion of a greater amount of functional 
antibodies. This clearly indicates that vectors based on 
alternative splicing can be an interesting alternative to 
conventionally used vectors. Moreover, only a restricted 
number of 3^SS mutants have been tested here. One can 
imagine that other mutations might allow to control more 
tightly the ratio between the two chains and thus permit 
even greater antibody productivity. 

In accordance to studies performed with other 
antibodies sequences in different expression systems, our 
results emphasize the fact that the optimum ratio for 
maximal antibody productivity is poorly predictable. 
The system developed in this study overcomes this 
problem by allowing rapid screening of several mutants, 
generating different possibilities of ratios between the two 
chains. In addition, our results indicate that if an excess of 
HC encoding mRNA is favourable for antibody secretion 
when expressed after permanent transfection, it seems 
that no clear rule can be established for transient 
transfection. In this last case, an empirical approach 
consisting of testing several possibilities remains 
appropriate. However, finding the best combination for 
each antibody to be expressed in a chosen cell type 
could be easily achieved. Moreover, this method, 
involving the use of only one plasmid, seems more 
reproducible than co -transfection of two plasmids in 
various ratios (4) because this technique could easily be 
biased by (i) various transfection efficiencies in transient 
experiments, (ii) extrapolation for stable transfections and 
(iii) chromosomal insertion sites of the two plasmids 
because these sites have a major effect on expression 
efficiency (21). Using a single vector with two independent 
genes generates invariable expression ratio between the 
two proteins but also (i) two identical promoters may 
lead to recombination events in the case of stable 
expression, (ii) different promoters could be perturbed 
by transcriptionnal interference or suppression events 
(1). In addition, plasmids with the EMCV IRES have 
also been successfully used to generate different mutants 
expressing various ratios of HC to LC (5). However, 
Mizuguchi et at. (22) showed that the translation efficiency 
of the IRES can only reach maximum 60% of the cap- 
dependent translation efficiency of the first cistron. On the 
opposite, our system clearly allows a larger panel of ratios 
from 10: 1 to 1 : 10, thus permitting extended possibilities 
compared to IRES-driven systems but also to 2A peptide- 
containing systems that only provide a unique ratio (23). 

CONCLUSION 

During the establishment of the Alternative Splicing- 
based Bicistronic (ASB) vector we have provided 



information concerning the presence of cryptic splice 
sites that could be usefully integrated for classical 
vectors that contain an intron in their 5^-UTR. In the 
particular example of antibody production, this work 
would facilitate further production of other murine 
antibodies, since the majority of the identified cryptic 
splice sites (11/15 identified sites) are located in the 
sequence that encodes the constant region of the LC. 
Indeed, simple cloning of different variable regions in 
frame with the previously mutated constant region can 
be done allowing an easier and faster research for 
additional cryptic splice sites and further optimize 
antibodies production efficiency. 

We provide here a novel system to elaborate bicistronic 
vectors that present several advantages in addition to its 
main interest which is the possibility to control the ratio of 
two expressed proteins: (i) the expression and the ratio are 
stable and very similar for several cell lines for both 
transient and stable transfection (ii) the vector is small 
and easy to manipulate, (iii) the vector could be easily 
adapted to study ESE or ESS. 

Further modifications could be performed in order to 
optimize the production yield. Finally this system might 
be easily adapted for expression of several heteromeric 
proteins for which the ratio between two subunits has to 
be tuned or for expression of two distinct proteins with 
synergistic effects at variable ratios, e.g. in gene therapy 
experiments. 
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